The aim of this report is to analyze vehicle accidents in the context of severity of injuries. Database used in the analysis includes information concering motor vehicle crashes within the City of Somerville from 01.01.2010 to 30.04.2018. Based on the original database two datasets were created: crashes and crashes_gender. This two datasets contain two common variables: injury and meansev. Second dataset (crashes_gender) was created separately because it was easier for the author to transform the original form of the variable contribuition in the smaller dataset. Let’s have a look at the both datasets.
Dataset consist of 4033 observations and 10 variables. Description of the variables:
| nb_of_vehicles | age | injury | meansev | |
|---|---|---|---|---|
| Min. :1.000 | Min. :12.0 | Min. :0.0000 | Min. :1.000 | |
| 1st Qu.:1.000 | 1st Qu.:27.0 | 1st Qu.:0.0000 | 1st Qu.:5.000 | |
| Median :2.000 | Median :36.0 | Median :0.0000 | Median :5.000 | |
| Mean :1.844 | Mean :40.3 | Mean :0.2137 | Mean :4.678 | |
| 3rd Qu.:2.000 | 3rd Qu.:51.5 | 3rd Qu.:0.0000 | 3rd Qu.:5.000 | |
| Max. :5.000 | Max. :96.0 | Max. :1.0000 | Max. :5.000 | |
| NA’s :2018 | NA’s :3290 | NA’s :594 | NA’s :596 |
Dataset consist of 752 observations (because of missing values in variable gender) and 4 variables.
Description of the variables:
This one presents accidents with or without injuries. The color on the map reflects whether anyone was injured in the accident or not. The size of the points reflects number of vehicles involved in the accident. We conclude that in most of the accidents no one was injured and there were at least two cars involved.
These maps describe severity of injuries. We can see that there was only one fatal accident with one car involved and the less severe injury the more accidents. We can also conclude that the most severe injuries which is incapacitating injury and fatal injury took place on the suburbs of the city. The reason for this may be that in the suburbs of the city the permitted speed is higher and as a consequence the accidents are more severe.
This graph presents the change in number of accidents through years 2013 and 2017 and also the ratio of accidents with injuries and without. The analyzed period has been shortened because of the fact that data concerning injuries was available from 2013 and we have incomplete data for 2018. This graph confirms that there were more accidents with no injuries. The overall number of accidents dropped between 2016 and 2017. The number of accidents without injuries varied more over time. Meanwhile the ones with some injuries were more or less stable in time.
This graph presents the change in number of accidents with injuries. We can see that number of accidents with possible injuries varied the most over time and there were more accidents of this kind. On the other hand number of accidents with incapacitating injuries hardly changed over time and there were less accidents of this kind.
This graph presents the change in number of accidents with injuries from 2013 to 2017. This confirms earlier conclusions because, as we can see, accidents with possible injuries dropped significantly while the ones with incapacitating injuries just slightly.
From this plot we can conclude that the highest number of accidents overall took place in March, January, May and December. But in case of March and January most of them were the accidents without any injuries. The highest number of accidents with injuries happened in June and May. This could be due to the fact that in winter and early spring the weather conditions are worse than in summer so most of the drivers are steering vehicles more carefully and as a consequence there are more accidents but there are not severe. The reason for large number of accidents in June and May may be that people go on vacation during these months and as a result there are more cars on the road. Additionally, good weather conditions encourage drivers to drive faster. All of this can lead to serious accidents.
The majority of accidents are the ones with possible injuries. We can see that the highest number of accidents with incapacitating injuries took place in May, November and August. High number of this kind off accidents also took place in February and October. Fatal accident took place in June. So as we can see most severe injuries took place in the summer, late autumn and winter. The reason for severe accidents in summer could be as mentioned before. In autumn and winter this could be due to the fact that some drivers are not steering their vehicles carefully despite bad weather conditions.
This plot presents number of accidents by the day of the week. We can conclude that the highest number of accidents both with and without injuries happened on Fridays. There were also a lot of accidents on Tuesdays and Thursdays. On Mondays there were also many accidents but most of them occurred without any injuries. The reason for that many accidents on Friday could be that it is the beginning of the weekend and people go on longer journeys. So there is more vehicles on the road and as a consequence there are more accidents overall and there are more severe accidents. The reason for the high number of accidents with no injuries on Mondays may be that people are in a rush to work because they overslept after the weekend.
This plot presents the same analysis but by the severity of injury. We can see that most often the accidents with incapacitating injuries happen on Fridays, Tuesdays and Thursdays. We can also see that a fatal accident took place on Thursday. The reason for more severe accidents on Fridays could be as above. People are going on longer journeys. Outside of the center of the city maximum authorized speed is greater. Combined with increased traffic on the road, this can lead to serious accidents.
These plots present changes in number of accidents throughout the day. In case of overall analysis which includes accidents with and without injuries the highest number of accidents took place between 7 a.m. and 8 a.m. and there are a lot of accidents throughout the working hours. In the case of analysis of accidents with some level of injuries we can see that the highest number of accidents took place between 8 a.m. and 9 a.m. There was also many accidents between 7 a.m. and 8 a.m., 2 p.m. and 3 p.m. and 4 p.m. and 5 p.m. It’s not surprising that the most serious accidents occur at time when people are driving to work and from work because in this hours there are more vehicles on the roads.
We can see that 67% of women and 63% of men cause accidents. So we can conclude that mostly women causes accidents but the difference is only 4 percentage points.
From this graph we can conclude that men cause more accidents with incapacitating injuries and non-incapacitating injuries while women cause more accidents with possible injuries.
The graph above presents distribution of age by severity of injuries. The analysis covers the age of 16 and over. We can see that in all cases the highest number of people involved in the accidents are people around age of thirty. In case of incapacitating injuries and non-incapacitating injuries we can also see that there are many people around age of fifty that were involved in the accidents but this is more evident in case of incapacitating injuries. In case of large number of accidents with injuries which involve people around age of thirty the reason may be that people at this age are more mobile and they can afford their own cars so there are more people at this age on the roads and as a consequence there are more accidents which involve this people. The reason for rising number of accidents in case of incapacitating injuries and non-incapacitating injuries which involves people around age of fifty could be that people at this age are more vulnerable so if they are involved in the accident the propability of severe injuries rises.
At this graph distribution of age was presented in the form of a histogram in order to show age of the person with fatal injuries. We can see that the person was 90 years old.
From the lolipop chart we can see that the highest number of accidents happened while driving passenger car. A lot of accidents also includes driving a light truck and a single unit track.
Based on this graph we can conclude that the most common accidents were these with no injuries involving passenger cars and light trucks. We can also see that in case of big and small buses, tractors, trailers and unknown heavy trucks there are only accidents with no injuries. In case of incapacitating injuries the most common vehicle types involved in the accidents were passenger car and light truck but there was also case with motorcycle and two with a single unit track. In the accident which casused fatal injury was involved a single unit track.